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<212 



> 379 

> DNA 



<:213> Drosophila Short Gastrulation Gene (SOG) 



< 400> 1 

atggeeaaea agctgaggaa 

6 0 

eeaeteetgg aaaggagetg 
120 

aaaaccagcc at agagaaca 
180 

'"tea tea teg ceeeaetget 
2 4 0 

gegcegetea tgttegagga 
:■ 0 0 

gaatgeeagc ttggeaaagt 
? 0 

c :aeeetteg gagttatgta 
c g e a t e g t z g e a e g : g t e r a 

a at g azecca z c z c ■ j t t g c c 

i-: ?ga t gt a g cett 3 ga tgt 

t.a:gctgcgt tgctaacggg 
o o 0 

at. :fta:acca ::ta:aat:: 

7 JO 



at cgaa 
ctgeca 
agcccc 
gat cgt 
gtccga 
tttgcg 
ct gcat 



:gcc atcgaatggg 



eacg gaggacgecg 

:atc ctgcgccaec 

-tgc ttgcgcccgg 

-acg ggcaggcggt 

zqa a z tggggtcca 

:aa g tgtgaatgtg 

gt gtgcgaa t atcaaaaacg 

eggaaaatge tgeaagaect 

ge z-zqt gec ^ a a tga a gagg 

::g:a::t:: tattt"tca 

gcagaatgtg gtggccaccg 



ccagcgccac 
cactggagcc 
tgagecaact 
tga eggaggg 
ccaaccgacc 
c-tggtatgc 
egg z gatacc 
agtgcccgcc 
gecccggcga 
aa g a gcgcaa 
a g g gt ga gga 
ccegttt zzt 



eggcacagta 
ccaagcgagc 
gagccacctg 
ccqc eggcat 
a g egg tea cc 
ggacttgggt 
caagaagegg 
ggccaaa t gc 
tcgaaacgat 
catgaaa cat 
aatgaagt cc 
gtt ccacaag 
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• 



aagaatctat actactcctt 
780 

ttcgttgatg atgcgggtgt 
840 

actctcagtg tctatcagaa 



ctacacctca tcgcgaatcg 

aatcctggag gagcatcaac 

tgccacgggc aagatctga 
879 




gtcgtccgcg tgccattcaa 
tggagaccac cttgcgcggc 
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<210> 2 
<211> 292 
<212> PRT 

<213> Drosophila Short Gastrulation Gene (SOG) 
<400> 2 

Met Ala Asn Lys Leu Arg Lys Ser Asn Ala lie Glu Trp Ala Thr Ala 
15 10 15 

Thr Gly Thr Val Pro Leu Leu Glu Arg Ser Cys Cys His Ser Glu Asp 
20 25 30 

Ala Ala Leu Glu Pro Gin Ala Ser Lys Thr Ser His Arg Glu Gin Ala 
35 40 45 

Pro lie Leu Arg His Leu Ser Gin Leu Ser His Leu Leu lie lie Ala 
50 55 60 

Gly Leu Leu lie Val Cys Leu Ala Gly Val Thr Glu Gly Arg Arg His 
65 70 75 80 

Ala Pro Leu Met Phe Glu Glu Ser Asp Thr Gly Arj Arg Ser Asn Arg 

8 5 90 9 5 

Pro Ala Val Thr Glu Cys Gin Phe Gly Lys Val Leu Arg Glu Leu Gly 
1 0 0 1 0 5 1 1 0 

Ser Thr Trp Tyr Ala Asp Leu Gly Pro Pro Phe Gly Val Met Tyr Cys 
115 120 125 

lie Lys Cys Glu Cys Val Ala lie Pro Lys Lys Arg Arg lie Val Ala 
130 135 140 

Arq VaJ Gin Cys Ara Asn lie Lys Asn Glu Cys Pro Pro Ala Lys Cys 
14 5 " 150 15 5 160 

Asp Asp Pre lie Ser Leu Pro Gly Lys Cys Cys Lys Thr Cys Pro Gly 

165 17 0 175 

Asp Arg Asn Asp Thr Asp Val Ala Leu Asp Val Pro Val Pro Asr. Glu 

1 8 0 1 1 5. 1 j 0 

Glu Glu Glu Arg Asn Met Lys His Tyr Ala Ala Leu Leu Thr Gly Arg 
135 ' 200 " 205 

Thr Ser Tyr Phe Leu Lys Gly Glu Glu Met Lys Ser Met Tyr Thr Thr 
210 215 220 

Tyr Asn Pro Gin Asn Val Val Ala Thr Ala Arg Phe Leu Phe His Lys 
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225 230 235 240 

Lys Asn Leu Tyr Tyr Ser Phe Tyr Thr Ser Ser Arg lie Gly Arg Pro 

245 250 255 

Arg Ala He Gin Phe Val Asp Asp Ala Gly Val He Leu Glu Glu His 
260 265 270 

Gin Leu Glu Thr Thr Leu Ala Gly Thr Leu Ser Val Tyr Gin Asn Ala 
275 280 285 

Thr Gly Lys He 
290 
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<210> 3 
<211> 978 
<212> DNA 

<213> Drosophila Short Gastrulation Gene (SOG) 
<400> 3 

atggccaaca agctgaggaa atcgaacgcc atcgaatggg ccacggccac cggcacagta 
60 

ccgctcctgc aaaggagctg ctgccacagc gaggacgccg cactggagcc ccaagcgagc 
120 

aaaaccagcc atagagaaca agcccccatc ctgcgccacc tgagccaact gagccacctg 
180 

ctcatcatcg ccggactgct gatcgtctgc ttggcgggcg tgacggaggg ccgccggcat 
240 

cgcccgctca tgttcgagga gtccgacacg ggcaggcggt ccaaccgacc agcggtcacc 
300 

aaatgccagt ttggcaaagt tttgcgcgaa ttggggtcca cctggtatgc ggatttgggt 
360 

ccacccttcg gagttatgta ctgcatcaag tgtgaatgtg tggcgatacc caagaagcgg 
420 

ccrcatcgttg cacgcgtcca gtgrgcgaat atcaaaaacg agtgcccgcc ggccaaatgc 
480 

aatgatccca tctcgttgcc cggaaaatgc tgcaagacct gtcccggcga tcgaaacgat 
54 0 

acggatgtag ccz tggatgt gcccgtgccc aatgaagagg aagagcgcaa catgaaacat 
60 0 

tacgctgcgt tgctaacggg ccgcacctcc tat tree tea agggtgagga aatgaagtcc 

6 6 0 

atqtacacca cctacaatcc gcagaatgtg gtggccaccg cccgtttcct gttccacaag 
72 0 

aaaaatctat actactcctt ctacacctca tegegaateg gtcgtccgcg tgccattcaa 

7 3 0 

ttcgtrgatg atgcgggtgt aaccctggag gagcatcaac tggagaccac ettggeggge 
y 4 0 

actctcagtg tctat :agaa tgccacgggc aagateggee geggctcgag ggtacctcta 

': 1 0 C 1 

gaggatcttt grgaaggaac 3t tact tct g tggtgt ga :a taattggaca aactacctac 
'? 6 0 

agagatttaa agctctaa 
■=♦7 8 
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<210> 4 
<211> 325 
<212> PRT 

<213> Drosophila Short Gastrulation Gene (SOG) 
<400> 4 

Met Ala Asn Lys Leu Arg Lys Ser Asn Ala He Glu Trp Ala Thr Ala 
15 10 15 

Thr Gly Thr Val Pro Leu Leu Glu Arg Ser Cys Cys His Ser Glu Asp 
20 25 30 

Ala Ala Leu Glu Pro Gin Ala Ser Lys Thr Ser His Arg Glu Gin Ala 
35 40 45 

Pro He Leu Arg His Leu Ser Gin Leu Ser His Leu Leu He He Ala 
50 55 60 

Gly Leu Leu He Val Cys Leu Ala Gly Val Thr Glu Gly Arg Arg His 

6 5 7 0 7 5 8 0 

Ala Pro Leu Met Phe Glu Glu Ser Asp Thr Gly Arg Arg Ser Asn Arg 

8 5 '90 9 5 

Pre- Ala Val Thr Glu Cys Gin Phe Gly Lys Val Leu Arg Glu Leu Gly 
100 105 110 

Ser Thr Trp Tyr Ala Asp Leu Gly Pro Pro Phe Gly Val Met Tyr Cys 
115 120 125 

He Lys Cys Glu Cys Val Ala He Pro Lys Lys Arg Arg He Val Ala 
130 135 140 

Arg Val Gin Cys Arg Asn He Lys Asn Glu Cys Pro Pro Ala Lys Cys 
145 150 155 160 

Asp Asp Pro lie Ser Leu Pro Gly Lys Cys Cys Lys Thr Cys Pro Gly 

165 170 17 5 

Asp Arg Asn Aso Thr Asp Val Ala Leu Asp Val Pro Val Pro Asn Glu 
160 135 190 

Glu Glu Glu Arg Asn Mot Lys His Tyr Ala Ala Leu Leu Thr Gly Arg 

1 9 5 2 0 0 2 0 5 

Thr Ser Tyr Phe Leu Lys Gly Glu Glu Met Lys Ser Met Tyr Thr Thr 
210 215. 2 2 0 

Tvr Asn Pro Hn Asn Val Val Ala Thr Ala Arg Phe Leu Phe His Lys 
225. 230 235 240 
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Lys Asn Leu Tyr Tyr Ser Phe Tyr Thr Ser Ser Arg lie Gly Arg Pro 

245 250 255 

Arg Ala He Gin Phe Val Asp Asp Ala Gly Val He Leu Glu Glu His 
260 265 270 

Gin Leu Glu Thr Thr Leu Ala Gly Thr Leu Ser Val Tyr Gin Asn Ala 
275 280 285 

Thr Gly Lys He Gly Arg Gly Ser Arg Val Pro Leu Glu Asp Leu Cys 
290 295 300 

Glu Gly Thr Leu Leu Leu Trp Cys Asp He He Gly Asn Thr Thr Tyr 
305 310 315 320 

Arg Asp Leu Lys Leu 

325 
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<210> 5 
<211> 978 
<212> DNA 

<213> Drosophila Short Gastrulation Gene (SOG) 
<400> 5 

atggccaaca agctgaggaa atcgaacgcc atcgaatggg ccacggccac cggcacagta 
60 

ccgctcctgg aaaggagctg ctgccacagc gaggacgccg cactggagcc ccaagcgagc 
120 

aaaaccagcc atagagaaca agcccccatc ctgcgccacc tgagccaact gagccacctg 
180 

ctcatcatcg ccggactgct gatcgtctgc ttggcgggcg tgacggaggg ccgccggcat 
240 

gcgccgctca tgttcgagga gtccgacacg ggcaggcggt ccaaccgacc agcggtcacc 

30 0 

aaatgccagt ttggcaaagt tttgcgcgaa ttggggtcca cgcagtatgc ggatttgggt 
360 

ccacccttcg gagttatgta ctgcatcaag tgtgaatgtg tggcgatacc caagaagcgg 
420 

cgcatcgttg cacgcgtcca gtgtcgcaat atcaaaaacg agtgcccgcc ggccaaatgc 

4 * 0 

aatgatccca tctcgttgcc cggaaaatgc tgcaagacct gtcccggcga tcgaaacgat 
54 0 

acggargtag ccttggatgt gcccgtgccc aatgaagagg aagagcgcaa catgaaacat 
6 0 0 

tacgctgcgt tgctaacggg ccgcacctcc tatttcctea agggtgagga aatgaagtcc 

6 60 

atgtacacca cctacaatcc gcagaatgtg gtggccaccg cccgtttcct gttccacaag 
720 

aagaatctat actac:cctt ctacacccca tcgcgaatcg gtcgtccgcg tgccattcaa 

7 3 0 

tccgttgatg atgcgggtgt aatcctggag gagcatcaac tggagaccac cttggcgggc 
3 4 0 

actctcagtg tctatcagaa tgccacgggc aagatcggcc gcggctcgag ggtacctcta 

9 0 0 

gaggat-ttt gtgaaggaac ct t a 3t t ct g tggtgtga^a taattggaca aactacctac 
9 6 0 

agagatttaa ag ct c t aa 
97 8 
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<210> 6 
<211> 325 
<212> PRT 

<213> Drosophila Short Gastrulation Gene (SOG) 
<400> 6 

Met Ala Asn Lys Leu Arg Lys Ser Asn Ala lie Glu Trp Ala Thr Ala 
15 10 15 

Thr Gly Thr Val Pro Leu Leu Glu Arg Ser Cys Cys His Ser Glu Asp 
20 25 30 

Ala Ala Leu Glu Pro Gin Ala Ser Lys Thr Ser His Arg Glu Gin Ala 
35 40 45 

Pro lie Leu Arg His Leu Ser Gin Leu Ser His Leu Leu lie lie Ala 
50 55 60 

Gly Leu Leu lie Val Cys Leu Ala Gly Val Thr Glu Gly Arg Arg His 

70 75 80 



Ala Pre Leu Met Phe Glu Glu Ser Asp Thr Gly Arg Arg Ser Asn Arg 

85 90 95 

Pro Ala Val Thr Glu Cys Gin Phe Gly Lys Val Leu Arg Glu Leu Gly 
100 105 110 

Ser Thr Ala Tyr Ala Asp Leu Gly Pro Pro Phe Gly Val Met Tyr Cys 
115 120 125 

He Lys Cys Glu Cys Val Ala He Pro Lys Lys Arg Arg He Val Ala 
130 135 140 

Arc Val Gin Cys Arg Asn He Lys Asn Glu Cys Pro Pro Ala Lys Cys 
145 ~ 15C 155 160 

A so Asp Pro lie Ser Leu Pro Gly Lys Cys Cys Lys Thr Cys Pro Gly 

165 170 175 

Asp Arq Asn Asp Thr Asp Val Ala Leu Asp Val Pro Asp Pro Asn Glu 
180 185 190 

Glu Glu Glu Arg Asn Met Lys His Tyr Ala Ala Leu Leu Thr Gly Arg 
195 " 2 00 20 5 

Thr Ser Tyr- Phe Leu Lys Gly Glu Glu Met Lys Ser Met Tyr Thr Thr 
210 215. 2 2 0 

Tyr Asn Pro Hn Asn Val Val Ala Thr Ala Arg Phe Leu Phe His Lys 
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225 



230 



235 



240 



Lys Asn Leu Tyr Tyr Ser Phe Tyr Thr Ser Ser Arg lie Gly Arg Pro 

245 250 255 

Arg Ala He Gin Phe Val Asp Asp Ala Gly Val He Leu Glu Glu His 
260 265 270 

Lys Leu Glu Thr Thr Leu Ala Gly Thr Leu Ser Val Tyr Gin Asn Ala 
275 280 285 

Thr Gly Lys He Gly Arg Gly Ser Arg Val Pro Leu Glu Asp Leu Cys 
290 295 300 

Glu Gly Thr Leu Leu Leu Trp Cys Asp He He Gly Asn Thr Thr Tyr 
305 310 315 320 



Arg Asp Leu Lys Leu 

325 
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<210> 7 
<211> 1041 
<212> DNA 

<213> Drosophila Short Gastrulation Gene (SOG) 
<400> 7 

atggccaaca agctgaggaa atcgaacgcc atcgaatggg ccacggccac cggcacagta 
60 

ccgctcctgg aaaggagctg ctgccacagc gaggacgccg cactggagcc ccaagcgagc 
120 

aaaaccagcc atagagaaca agcccccatc ctgcgccacc tgagccaact gagccacctg 
180 

ctcatcatcg ccggactgct gatcgtctgc ttggcgggcg tgacggaggg ccgccggcat 
24 0 

gcgccgctca tgttcgagga gtccgacacg ggcaggcggt ccaaccgacc agcggtcacc 
3 0 0 

qaatgccagt ttggcaaagt tttgcgcgaa ttggggtcca cctggtatgc ggatttgggt 

3 60 

ccacccttcg gagttatgca ctgcatcaag tgtgaatgtg tggcgatacc caagaagcgg 
420 

ccrcatcgttg cacgcgtcca gtgtcgcaat atcaaaaacg agtgcccgcc ggccaaatgc 

4 6 0 

gatgatccca tctcgrtgcc cggaaaatgc tgcaagacct gtcccggcga tcgaaacgat 
i40 

acggatgtag ccttggatgt gcccgtgccc aatgaagagg aagagcgcaa catgaaacat 
60 0 

tacgctgcgt tgctaacggg cgcgacctcc tatttcctca agggtgagga aatgaagtcc 

6 60 

atqtacacca cctacaatcc gcagaatgtg gtggccaccg cccgtttcct gttccacaag 
72 0 

aaqaatctac ac:actcct: czacacczca tcgcgaatcg gtcgtccgcg tgccattcaa 

7 8 0 

"rg—gatg atgcgggtgt aarccrggag gagcatcaac tggagaccar cttggcgggc 
y 4 0 

actctcagtg tc:atcagaa tgccacgggc aagatcggcc gcggctcgag gcagcgcggc 
9 0 0 

cgcatctttt aczzazac^a tgttcctgac ta^gcgggct atccctatga cgtcccggac 
9 6 C' 

tatgcaggat cc:atccata tgacgtt rca gatitacgctg ctcagtgcgg ccgcgattat 
102 0 

a-jggacgacg acgacaaatc a 
1041 
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<210> 8 
<211> 346 
<212> PRT 

<213> Drosophila Short Gastrulation Gene (SOG) 
<400> 8 

Met Ala Asn Lys Leu Arg Lys Ser Asn Ala lie Glu Trp Ala Thr Ala 
15 10 15 

Thr Gly Thr Val Pro Leu Leu Glu Arg Ser Cys Cys His Ser Glu Asp 
20 25 30 

Ala Ala Leu Glu Pro Gin Ala Ser Lys Thr Ser His Arg Glu Gin Ala 
35 40 45 

Pro lie Leu Arg His Leu Ser Gin Leu Ser His Leu Leu lie lie Ala 
50 " 55 60 

Gly Leu Leu lie Val Cys Leu Ala Gly Val Thr Glu Gly Arg Arg His 
65 70 75 80 

Ala Pro Leu Met Phe Glu Glu Ser Asp Thr Gly Arg Arg Ser Met Arg 

8 5 90 " 95 

Pro Ala Val Thr Glu Cys Gin Phe Gl.y Lys Val Leu Arg Glu Leu Gly 
100 10 5 110 

Ser Thr Trp Tyr Ala Asp Leu Gly Pro Pro Phe Gly Val Met Tyr Cys 
115 120 125 

He Lys Cys Glu Cys Val Ala He Pro Lys Lys Arg Arg He Val Ala 
130 135 140 

Arg Val Gin Cys Arq Asn He Lys Asn Glu Cys Pro Pro Ala Lys Cys 
145 " 150 155 160 

Asp Asp Pro He Ser Leu Pro Gly Lys Cys Cys Lys Thr Cys Pro Gly 

165 170 175 

Asp Arq Asn Asp Thr Asp Val Ala Lou Asp Val Pro Val Pro Asn Glu 
18 0 IS 5 190 

Hu Glu Glu Arg Asn Met Lys His Tyr Ala Ala Leu Leu Thr Gly Arg 

19 5 2 0 0 2 0 5 

Thr Ser Tyr Phe Leu Lys Gly Glu Glu Met Lys Ser Met Tyr Thr Thr 
210 215 220 

Tyr Asn Pro Gin Asn Val Val Ala Thr Ala Arg Phe Leu Phe His Lys 
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225 230 235 240 

Lys Asn Leu Tyr Tyr Ser Phe Tyr Thr Ser Ser Arg lie Gly Arg Pro 

245 250 255 

Arg Ala He Gin Phe Val Asp Asp Ala Gly Val He Leu Glu Glu His 
260 265 270 

Gin Leu Glu Thr Thr Leu Ala Gly Thr Leu Ser Val Tyr Gin Met Ala 
275 280 285 

Thr Gly Lys He Gly Arg Gly Ser Arg Asn Arg Gly Arg He Phe Tyr 
290 295 300 

Pro Tyr Asp Val Pro Asp Tyr Ala Gly Tyr Pro Tyr Asp Val Pro Asp 
305 310 315 320 

Tyr Ala Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ala Asn Cys 

325 330 335 

Gly Arg Asp Tyr Lys Asp Asp Asp Asp Lys 
340 345 



<210> 9 
<211> 213 
<212> PRT 

<213> orosophila Short Gas t rulat ion Gene (SOG) 
<400> 9 

Met Asp His Ser Gin Cys Leu Val Thr lie Tyr Ala Leu Met Val Phe 
1 ^ 5 10 15 

Leu Gly Leu Arg lie Asp Gin Gly Gly Cys Gin His Tyr Leu His He 

2 0 2 5 3 0 

Ar: Pro Ala Pro Ser Glu Asn Leu Pro Leu Val Asp Leu He Glu His 
3 5 4 0 4 5 

?r:. Asp Pro Lie Tyr Asp Pro Lys Glu Lys Asp Glu Leu Asn Glu Thr 
5 0 5 5 60 

Leu Leu Ara Thr Leu Met Val Gly His Phe Asp Pro He Leu Pro Glu 
65 " 7 0 7 5 8 0 

Glu Arg Leu Gly VH Glu Asp Leu Gly Glu Leu Asp Leu Leu Leu Arg 

8 5 90 9 5 

Gin Lys Pro Ser Sly Ala Met Pro Ala Glu He Lys Gly Leu Glu Phe 
100 105 110 
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Tyr Glu Gly Leu Gin Ser Lys Lys His Arg Leu Ser Lys Lys Leu Arg 
115 120 125 

Arg Lys Leu Gin Met Trp Leu Trp Ser Gin Thr Phe Cys Pro Val Leu 
130 135 140 

Tyr Thr Trp Asn Asp Leu Gly Arg Tyr Val Lys Val Gly Ser Cys Tyr 
145 150 155 160 

Ser Lys Arg Ser Cys Ser Val Pro Glu Gly Met Val Cys Lys Ala Ala 

165 170 175 

Lys Ser Met His Leu Thr lie Leu Arg Trp Arg Cys Gin Arg Arg Val 
130 185 190 

Gin Gin Lys Cys Ala Trp He Thr He Gin Tyr Pro Val He Ser Glu 
195 200 205 

Cys Lys Cys Ser Cys 
210 



<210> 10 
<211> 53 
<212> PRT 

<213> Drosophila Short Gas"Crulation Gene (SOG) 
<400> 10 

Val Asp Leu He Glu His Pro Asp Pro He Tyr Asp Pro Lys Glu Lys 
1 5 10 15 

Asp Leu Asn Glu Thr Leu Leu Arg Thr Leu Met Val Gly His Phe Asp 
2 0 2 5 30 

Pro He Leu Pro Glu Glu Arg Leu Gly Val Glu Asp Leu Gly Glu Leu 
35 "40 4 5 

Asp Leu Leu Leu Arg 
50 
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<210> 11 
<211> 4892 
<212> DNA 

<213> Short Gastrulation Gene (SOG) 
<400> 11 

atattagatg gacatgcata ataattattc atgtaactat gtgattttca ttttacacga 
60 

ggtqtcagtc agaatttaaa attcttaaaa attgcaatca cggtctattg tacatattta 
120" 

tgtatgcgat ctcattatta ttattattat tatttgataa tatattagca gctgttcaaa 
180 

tcatcaacaa gtataaaatt gaaaatgtta aaaattgaag cagtcttggg tttgcctgca 
240 

acatgttgct gccgatcgtt agatgttgct gctgeatgnt gccgctgcat gttgccgttg 
30 0 

catgttgccg ccgtttggca actttataaa cacggagcgg attcagtcgt tcaggttcag 
360 " 

tctctcttga attgcgtggg attgcacatc ggtcgttcgg ctttttgggt tcggcattta 
420 

gagagatacg acacgatccg a tacgatccg atccagtaca aaaatcaaat tcaaaccgca 
4 60 

ctcccgatcc ggt cgccatc atatacacgg eggctcgcac cgcagctatc tagtagataa 
34 0 

aaagt cgccg agaacgcagt acgcatat ag cgaaaegcca aaaaaataaa aaaaaaagtc 
6 0 0 

gcgttgtcgt aatccctcct cataegagat cgactetatt ttccagagca agctaaacac 
6 6 0 

actagtgcta aaccataact atatatctaa ctaaggaaaa caaagtctcg aaactgaaaa 
72 0 

cqaaaagegc aaattratgc agccgctaaa taaaaacaga aaccaaaaca taaaacacac 
730 

tatacaaarc ata :aaaaca aaaacagcga at caaatagt ataaaaaaaa aataaataaa 
34 0 

tgaggaataa aaaaacaggc aaatagaata aatccaataa atcggcgcgc gaaacrccgc 
3 0 0 

tgtgttatct aatctgcaag agaagtacaa gaarcgggca tagaatcggc t eta tact at 
3 6 0 

atctatacac ctgatatatc tatarccatt gtgtgrgcca gcgtgtgcgt ggcgaccttt 
1^2 0 

qtctttatat att cttcg" qzz g^ teat a ctgtgaaacg tgctctctac aagceggteg 
10 3 0 

t f .-:aaaatac aaaat act ac aaatcaaatc aaatcataca ca :ata:a:a rcagtaaaaa 
1 1 4 3 

caaaacaaaa aac acataaa catggccaac a a gee gag ga a a t eg a aege catcgaatgg 

12 0 0 

gccacggcca ccggcacagt accgctcctg gaaaggagct gctgccacag cgaggacgcc 
12 60 
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gcactggagc cccaagcgag caaaaccagc catagagaac aagcccccat cctgcgccac 
1320 

ctgagccaac tgagccacct gctcatcatc gccggactgc tgatcgtctg cttggcgggc 
1380 

gtgacggagg gccgccggca tcgcccgctc atgttcgagg agtccgacac gggcaggcgg 
1440 

tccaaccgac cagcggtcac cgaatgccag tttggcaaag ttttgcgcga attggggtcc 
1500 

acctggtatg cggatttggg tccacccttc ggagttatgt actgcatcaa gtgtgaatgt 
1560 

gtggcgatac ccaagaagcg gcgcatcgtt gcacgcgtcc agtgtcgcaa tatcaaaaac 
1620 

gagtgcccgc cggccaaatg cgatgatccc atctcgttgc ccggaaaatg ctgcaagacc 
168 0 

tgtcccggcg atcgaaacga tacggatgta gccttggatg tgcccgtgcc caatgaagag 
1740 

gaagagcgca acatgaaaca ttaegctgcg t tggtaacgg gccgcacctc ctatttcctc 
13 0 0 

aagggtgagg aaatgaagtc catgtacacc acctacaatc cgcagaatgt ggtggccacc 
13 60 

gcccgtttcc tgttccacaa gaagaatcta tactactcct tciacacctc atcgcgaatc 
1 92 0 

ggtcgtccgc gtgccattca atccgttgat gatgcgggtg taatcetg ga ggagcatcaa 
1 9 8 0 

ctggagacca cc ttggcggg cactctcagt gtetatcaga atgccacggg caagatctgc 
2 0 4 0 

ggtgtctggc gacgagttcc acgtgattac aagegcatcc tggcggacga tgctctccat 
2 1 0 0 

attgtcctcc tctggggcaa caaacagcag gccgagttgg ctctggccgg aaaggtggcc 
2160 

aaatacacgg ccctgcagac ggagttgttc agttcgctac tggaggcacc acttcccgat 
222 0 

ggcaaaacgg atccccagct ggccggagcc ggtggcacag cgatcgtgtc caccagcagc 
2 2 8 0 

ggtgccgcct catcgatgca tctcaccctg gtct tcaatg gtgtctttgg tgccgaggag 
2 8 4 0 

tacgccga tg ca gca ctga g t gtgaaaatt gag ^tggcag aacggaagga ggt gaccttc 

2-1 8 0 

gatga gat t c cacgtgtgcg caaaccctct gccgagatca atgtcctgga gctgtcgteg 
2 4 6 0 

cccatttcca tacagaatct tcgactgatg tcgcgtggca aactcctgct gaccgtggag 
J 5 2 0 

tec a a gaagt acc ca :a:a: gcg catccag g gacacatcg tg a "cgagc cagctgegaa 

/; 5 :: 0 

atcttccaga cec tgc tggc gecgeacagt gecgaate zz cgac caag ag cageggt ttg 
2 64 0 

gcqt gggtct act:gaa:a: eg at ggatct zt g g z :ta-:a acatc gaaac ggagcacgtg 
: 7 i j 0 

aacacccggg ata g gc zc aa catcagtttg attga gga gc ag ggcaagc g gaaggccaag 
2760 
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ctggaggatc 
2820 

ggtcccaagg 
2880 

catgagacga 
2940 

tcggcggaac 
3000 

gtcggcatgg 
3060 

aacggtgtgc 
3120 

qgagcgccag 
3180 

ttcctcagca 
3240 

qtccat tcca 
3300 

ccqggtcact: 
3 360 

gataac catt 
3420 

t C Cf ■ J 3. g C 3 t t 

3480 

cacagt tgcg 
3 54 3 

caga gtga tg 
3 600 

qcgcaateet 
3660 

ctqqgcgagc 
3720 

ttegat ace: 
3 7 8 0 

ctcqtctgcc 
3 84 0 

tgctg ^aaga 
3 0 0 

aacaac cc "a 

3 9 6 0 

gaggctctgg 

4 0 2 0 

t. qgeatcega 
4 08 0 

tec a aggtga 
4 1 4 0 

e: gaceagca 
4 3 00 

ctctgctcca 
4360 



tgacgecgag 
tcctcgagtc 
gcctgatccg 
ccat tetget 
cctggatgtc 
ccgcccagga 
tgacgaggaa 
tgccatccgc 
ag cactc caa 
getteecegt 
tggtgaacgg 
ggcgeagtge 
aggt ca tea a 
gtgaa tgctg 
cgccagcca c 
agttcca tec 
geaecaectg 
egeegttgea 
tct gt ccgga 
atgtgctg ca 
ccaacggcgg 
tcctgatgtc 
actgzgat gc 
aa zq g cgt :: t 
cecactgccg 



cttcaacttc 
getgtatgee 
tggccgccta 
gaagcgacag 
cattgacaac 
tetgeagctg 
attgetcgag 
cgaactgatc 
acagct tctg 
ctataeggac 
agagacc aag 
c c a g g a 1 t c c 
gtgtccgcct 
tcccagctgt 
caatgccacc 
egecggtgee 
ca getgegat 
gtgeagega g 
g ggcaagca g 
g gatcagg :c 
a tgcaaggtg 
ccacggcgag 
caageget gc 
g 1 1 egaga aa 
ga gatcgagg 



aaccaggcca 
ggcgaactgg 
gtg ccccqtc 
gagcacaegg 
gagtgeaate 
tatctggagg 
gaatteaacg 
aagctggaga 
ctgcgcggca 
aacaatgt tc 
tgcttteact 
tgtcagatgt 
e tea agtgca 
gtgc ccaaga 
gatttget gc 
agttggcatc 
cccctgaccc 
a agetggect 
agcagt tcca 
atgeagegat 
geca acaagg 
caga agtgca 
t "cget -ca 
c-gga-gcag 
cgecaccaea 



ttggtagtgt 
gcgttaatgt 
cagtggccga 
atgcacagaa 
tgeactaega 
agaageccat 
gctcctatct 
tgagcgtctg 
a a ctgaagag 
ccgtgcctgg 
ccggacgct t 
gcgcctgttt 
agtccacgga 
aggaggeege 
aacagegacg 
ca ttcetgee 
tcgagattcg 
a t eg tc caga 
a t ggacacaa 
cgecgagtca 
tgtacga gaa 
tcaagtgccg 
cgtgccagea 
ctget ceggc 
agaggeagee 



ggagaagttg 
ggccaccgag 
tgetegggae 
tccacatgcc 
ggtgacgctc 
egaggegatt 
ggaaggcttc 
ctatctggag 
ca ecaaggtg 
cgaccacaat 
ctac aacgaa 
gcgtggccaa 
gc a actgett 
cgactattca 
cggctgcgcg 
gcccaatggc 
ctgteeccgg 
caa gaaggca 
ga eg a eg cog 
cagtgecgag 
eggeeaggag 
ctgea aggae 
gea gaeaege 
eategatgae 
geateateag 



cagcgatcct 
4320 

ccttggcggc 
4380 

tgtgcaccac 
4440 

catctacaca 
4500 

gttgcataga 
4560 

tatgaatcac 
4620 

ggaggaaaac 
4680 

ttggaaaagg 
4740 

attgaaagca 
4800 

cccctcccca 
4860 

atttattaca 
4892 



ccagctgagc 
aggggagcga 
ccaaacacac 
gacacacaga 
tcgttgttgc 
ggaggagaaa 
ggtagtaat a 
acattaaaac 
ttttctgaat 
accaacaaac 
aaacaaaaac 



ggctccacgt 
accaatcact 
acacacacac 
cagccacaaa 
tatcttatca 
acgtaggaga 
gagagagaga 
cagtgcagtt 
ttcttttcgc 
acccaaaaaa 
aaaaaaaaaa 



gacggatggg 
cactcactca 
acacacacaa 
agcgaacgcg 
tgtggcagca 
gaaatct cac 
gagagggaag 
tgcttt aaat 
agttacccca 
aaaaactaaa 
aa 



atcccaatcc 
ccaccactca 
ccacacaaca 
cacacagact 
atgagaactt 
aaaaaatata 
gagagagtct 
tctccagcgc 
cccgtgtaac 
aacattaaaa 



agtatcagat 
gtgtactcag 
ctcacaccca 
tgtgcaagga 
gtattatata 
tatatcttat 
aatgagatcc 
agaattttct 
ccaatcccct 
tacaatttta 
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<210> 12 
<211> 1038 
<212> PRT 

<213> Drosophila Short Gastrulation Gene (SOG) 
<400> 12 

Met Ala Asn Lys Leu Arg Lys Ser Asn Ala lie Glu Trp Ala Thr Ala 
15 10 15 

Thr Gly Thr Val Pro Leu Leu Glu Arg Ser Cys Cys His Ser Glu Asp 
20 25 30 

Ala Ala Leu Glu Pro Gin Ala Ser Lys Thr Ser His Arg Glu Gin Ala 
35 40 45 

Pro lie Leu Arg His Leu Ser Gin Leu Ser His Leu Leu lie lie Ala 
50 55 60 

Gly Leu Leu lie Val Cys Leu Ala Gly Val Thr Glu Gly Arg Arg His 
65 70 75 80 

Ala Pro Leu Met Phe Glu Glu Ser Asp Thr Gly Arg Arg Ser Asn Arg 

3 5 90 95 

Pro Ala Val Thr Glu Cys Gin Phe Gly Lys Val Leu Arg Glu Leu Gly 
100 105 110 

Ser Thr Trp Tyr Ala Asp Leu Gly Pro Pro Phe Gly Val Met Tyr Cys 
115 120 125 

lie Lys Cys Glu Cys Val Ala He Pro Lys Lys Arg Arg He Val Ala 
130 135 14 0 

Arg Val Gin Cys Arg Asn Lie Lys Asn Glu Cys Pre* Pro Ala Lys Cys 
145 150 155 160 

Asp Asp Pre- He Ser Leu Pro Gly Lys Cys Cys Lys Thr Cys Pro Gly 

165 170 175 

Asp Arg Asn Asp Thr Asp Val Ala Leu Asp Val Pre- Val Pro Asn Glu 
18 0 135 190 

Glu Glu Glu Arg Asn Met Lys His Tyr Ala Ala Leu Leu Thr Gly Arg 
1 95 2 00 20 5 

Thr Ser Tyr Phe Leu Lys Gly Glu Glu Met Lys Ser Met Tyr Thr Thr 

210 215 220 

Tyr Asn Pro Hn Asn Val Val Ala Thr Ala Arg Phe Leu Phe His Lys 
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225 230 235 240 

Lys Asn Leu Tyr Tyr Ser Phe Tyr Thr Ser Ser Arg lie Gly Arg Pro 

245 250 255 

Arg Ala He Gin Phe Val Asp Asp Ala Gly Asx He Leu Glu Glu His 
260 265 270 

Gin Leu Glu Thr Thr Leu Ala Gly Thr Leu Ser Val Tyr Gin Asn Ala 
275 280 235 

Thr Gly Lys He Cys Gly Val Trp Arg Arg Val Pro Arg Asp Tyr Lys 
290 295 300 

Arg He Leu Arg Asp Asp Arq Leu His Val Val Leu Leu Trp Gly Asn 
305 310 ~ 315 320 

Lys Gin Gin Ala Glu Leu Ala Leu Ala Gly Lys Val ALa Lys Tyr Thr 

32 5 330 335 

Ala Leu Gin Thr Glu Leu Phe Ser Ser Leu Leu Glu Ala Pro Leu Pro 
340 345 350 

Asp Gly Lys Thr Asp Pro Gin Leu Ala Gly Ala Gly Gly Thr Ala He 
355 360 365 

Val Ser Thr Ser Ser Gly Ala Ala Ser Ser Met His Leu Thr Leu Val 
370 375 380 

Phe Asn Gly Val Phe Gly Ala Glu Glu Tyr Ala Asp Ala Ala Leu Ser 
385 390 395 400 

Val Lys lie Glu Leu Ala Glu Arg Lys Glu Val He Phe Asp Glu He 

4 0 5 4 1 0 415 

Pro Arg Val Arg Lys Pro Ser Ala Glu He Asn Val Leu Glu Leu Ser 
420 415 430 

Ser Pro lie Ser He Gin Asn Leu Arg Leu Met Ser Arg Gly Lys Leu 
4 35 44 0 44 5 

Leu Leu Thr Val Glu Ser Lys Lys Tyr Pro His Leu Arg He Gin Gly 
4 50 455 4 60 

His He Val Thr Arq Ala Ser Cys Glu lie Phe Gin Thr Leu Leu Ala 
46: ^470 475 48 0 

Pro His Ser Ala H j Ser Ser Thr Lys Ser Ser Gly Leu Ala Trp Val 

4 8 5 4 9 J 4 9 5 
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Tyr Leu Asn Thr Asp Gly Ser Leu Ala Tyr Asn lie Glu Thr Glu His 
500 505 510 

Val Asn Thr Arg Asp Arq Pro Asn lie Ser Leu lie Glu Glu Gin Gly 
515 " 520 525 

Lys Arg Lys Ala Lys Leu Glu Asp Leu Thr Pro Ser Phe Asn Phe Asn 
530 535 540 

Gin Ala lie Gly Ser Val Glu Lys Leu Gly Pro Lys Val Leu Glu Ser 
545 550 555 560 

Leu Tyr Ala Gly Glu Leu Gly Val Asn Val Ala Thr Glu His Glu Thr 

565 570 575 

Ser Leu lie Arg Gly Arg Leu Val Pro Arq Pro Val Ala Asp Ala Arg 
580 585 " 590 

Asp Ser Ala Glu Pro lie Leu Leu Lys Arg Gin Glu His Thr Asp Ala 
59 5 60 0 60 5 

Gin Asn Pro His Ala Val Gly Met Ala Trp Met Ser He Asp Asn Glu 
610 615 62 0 

Cys Asn Leu His Tyr Glu Val Thr Leu Asn Gly Val Pro Ala Gin Asp 
62 5 6 30 63 5 64 0 

Leu Gin Leu Tyr Leu Glu Glu Lys Pro Ho Glu Ala He Gly Ala Pro 

645 650 655 

Val Thr Arg Lys Lou Leu Glu Glu Phe Asn Gly Ser Tyr Leu Glu Gly 
660 665 670 

Phe Phe Leu Ser Met Pro Ser Ala Glu Leu He Lys Leu Glu Met Ser 
67 5 68 0 68 5 

Val Cys Tyr Leu Glu Val His Ser Lys Met Ser Lys Cys Leu Leu Leu 
6 90 6 95 7 00 

Arg Gly Lys Leu Lys Ser Thr Lys Val Pro Gly His Cys Phe Pro Val 
705 710 715 720 

Tyr Thr Asp Asn Asn Val Pro Val Pro Gly Asp His Asn Asp Asn His 

725 730 735 

Leu Val Asn Gly Glu Thr Lys Cys Phe His Ser Gly Arg Phe Tyr Asn 
74 0 ' 74 5 7 50 

Glu Ser Glu Gin Trp Arg Ser Ala Gin Asp Ser Cys Gin Met Cys Ala 

755 " 760 765 
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Cys Leu Arg Gly Gin Ser Ser Cys Glu Val He Lys Cys Pro Ala Leu 
770 775 780 

Lys Cys Lys Ser Thr Glu Gin Leu Leu Gin Arg Asp Gly Glu Cys Cys 
785 790 795 800 

Pro Ser Cys Val Pro Lys Lys Glu Ala Ala Asp Tyr Ser Ala Gin Ser 

805 810 815 

Ser Pro Ala Thr Asn Ala Thr Asp Leu Leu Gin Gin Arg Arg Gly Cys 
820 825 830 

Arg Leu Gly Glu Gin Phe His Pro Ala Gly Ala Ser Trp His Pro Phe 
835 840 845 

Leu Pro Pro Asn Gly Phe Asp Thr Cys Thr Thr Cys Ser Cys Asp Pro 
850 855 860 

Leu Thr Leu Glu He Arg Cys Pro Arg Leu Val Cys Pro Pro Leu Gin 
865 870 875 880 

Cys Ser Glu Lys Leu Ala Tyr Pro Pro Asp Lys Lys Ala Cys Cys Lys 

385 890 395 

He Cys Pro Glu Gly Tyr Gin Ser Ser Ser Asn Gly His Lys Thr Thr 
90 0 ="0 5 91 0 

Pro Asn Asn Pro Asn Val Leu Gin Asp Gin Ala Met Gin Arg Ser Pro 
9 1 5 9 2 0 92 5 

Ser His Ser Ala Glu Glu Val Leu Ala Asn Gly Gly Cys Lys Val Val 
930 935 94 0 

Asn Lys Val Tyr Glu Asn Gly Gin Glu Trp His Pro lie Leu Met Ser 
94 5 " 950 955 960 

His Gly Glu Gin Lys Cys lie Lys Cys Arg Cys Lys Asp Ser Lys Val 

965 970 975 

Asn Cys Asp Ala Lys Arg Cys Ser Arg Ser Thr Cys Gin Gin Gin Thr 
98 0 ' 98 5 9 90 

Arg Val Thr Ser Lys Arg Arg Leu Phe Glu Lys Pro Asp Ala Ala Ala 

9 9 5 1 0 C 0 1 0 0 5 

Pro Ala lie Asp Liu Phe Lys Ser Thr Gin Cys Arg Arg Ser Arg Arg 
1010 1 0 15 1 0 2 0 

His His Lys Arq Gin Pro His His Gin Gin Arg Ser Ser Ser 
1025 " 1030 1035 
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